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A NOVEL METHOD OF DIAGNOSING, 
MONITORING, AND STAGING COLON CANCER 

FIELD OF THE INVENTION 

This invention relares, in part, to newly developed 
assays for detecting, diagnosing, monitoring, staging, and 
prognosticating cancers, particularly colon cancer. 

BACKGROUND OF THE INVENTION 

Colon cancer is the second most frequently diagnosed 
malignancy in the United States. Cancer of the 

gastrointestinal tract, especially colon cancer, is a highly 
treatable and often a curable disease when localized to the 
bowel. However, currently colon cancer is the second most 
common cause of cancer death. Surgery is the primary 
treatment and results in cure in approximately 50% of 
patients. Recurrence following surgery is a major problem and 
often is the ultimate cause of death. The prognosis of colon 
cancer is clearly related to the degree of penetration of the 
tumor through the bowel wall and the presence or absence of 
nodal involvement. These two characteristics ,orin the basis 
for all staging systems developed for this disease. Bowel 
obstruction and bowel perforation are indicators of poor 
prognosis. Elevated pretreatment serum levels of 

carcinoembryonic antigen (CEA) and carbohydrate antigen 19-9 
(CA 19-9) also have negative prognostic significance. 

Because of the frequency of the disease (approximately 
160,000 new cases of colon cancer per year), the 
identification of high-risk groups, the demonstrated slow 
growth of primary lesions, the better survival of early-stage 
lesions, and the relative simplicity and accuracy of screenina 
tests, screening for colon cancer should be a part of routine 
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care for all adults starting at age 50, especially those with 
first -degree relatives with colorectal cancer. 

Procedures used for detecting, diagnosing, monitoring, 
staging, and prognosticating colon cancer are of critical 
importance tc the outcome of the patient. For example, 
patients diagnosed with early colon cancer generally have a 
much greater five-year survival rate as compared to the 
survival rate for patients diagnosed with distant metastasized 
colon cancer. Treatment decisions are usually made in 
reference to the older Dukes or the Modified Ast ler -Coller 
(MAC) classification schema for staging. However, new 
diagnostic methods which are more sensitive and specific for 
detecting early colon cancer are clearly needed. 

Further, colon cancer patients must be closely monitored 
following initial therapy and during adjuvant therapy to 
determine response to therapy and to detect persistent or 
recurrent disease of metastasis. Thus, there is clearly a 
need for a colon cancer marker which is more sensitive and 
specific in detecting colon cancer recurrence. 

Another important step in managing colon cancer is to 
determine the stage of the patient's disease. Stage 
determination has potential prognostic value and provides 
criteria for designing optimal therapy. Currently, 
pathological staging of colon cancer is preferable over 
clinical staging as pathological staging provides a more 
accurate prognosis. However, clinical staging would be 
preferred were the method of clinical staging at least as 
accurate as pathological staging because it does not depend 
on an invasive procedure to obtain tissue for pathological 
evaluation. Staging of colon cancer would be improved by 
detecting new markers in cells, tissues, or bodily fluids 
which could differentiate between different stages of 
invasion . 

Ir. the present invention, methods are provided for 
detecting, diagnosing, monitoring, staging, and 
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prognosticating colon cancers, particularly colon, stomach, 
and small intestine cancer, via nine (9) Colon Specific Genes 
(CSGs). The nine CSGs refer, among other things, to native 
proteins expressed by the genes comprising the polynucleotide 
sequences of any of SEQ ID NO:l, 2, 3, 4, 5, 6, 7, 8 or 9. 
In the alternative, what is meant by the nine CSGs as used 
herein, means the native mRNAs encoded by the genes comprising 
any of the polynucleotide sequences of SEQ ID NO:l, 2, 3, 4, 
5, 6, 7, 8 or 9 or levels of the genes comprising any of the 
polynucleotide sequences of SEQ ID NO:l, 2, 3, 4, 5, 6, 7, 8 
or 9. 

Other objects, features, advantages and aspects of the 
present invention will become apparent to those of skill in 
the art from the following description. It should be 
M understood, however, that the following description and the 

specific examples, while indicating preferred embodiments of 
the invention, are given by way of illustration only. Various 
changes and modifications within the spirit and scope of the 
disclosed invention will become readily apparent to those 
skilled in the art from reading the following description and 
from reading the other parts of the present disclosure. 

SUMMARY OF THE INVENTION 

Toward these ends, and others, it is an object of the 
present invention to provide a method for diagnosing the 

?3 presence of colon cancer in a patient which comprises 

measuring levels of CSG in a sample of cells, tissue or bodily 

... fluid from the patient and comparing the measured levels of 

CSG with levels of CSG in preferably the same cells, tissue, 
or bodily fluid type of a control, wherein an increase in the 
measured CSG levels in the patient versus levels of CSG in the 
control is associated with colon cancer. 

Another object of the present invention is to provide 
a method of diagnosing metastatic colon cancer in a patient 
which comprises measuring CSG levels in a sample of cells, 



BNSDOCID: <WO ... . . 99601 61 A1 I > 



WO 99/60161 



PCT/US99/10498 



- 4 - 

tissue, or bodily fluid from the patient and comparing the 
measured CSG levels with levels of CSG in preferably the same 
cells, tissue, cr bodily fluid type of a control, wherein an 
increase in measured CSG levels in the patient versus levels 
of CSG in the control is associated with a cancer which has 
met astas i zed . 

Another object of the present invention is to provide 
a method of staging colon cancer in a patient which comprises 
identifying a patient having colon cancer, measuring levels 
of CSG in a sample of cells, tissues, or bodily fluid obtained 
from the patient, and comparing the measured CSG levels with 
levels of CSG in preferably the same cells, tissue or bodily 
fluid type of a control. An increase in measured CSG levels 
in the patient versus CSG levels in the control can be 
associated with a cancer which is progressing while a decrease 
or equivalent level of CSG measured in the patient versus the 
control can be associated with a cancer which is regressing 
or in remission. 

Another object of the present invention is to provide 
a method of monitoring colon cancer in a patient for the onset 
of metastasis. The method comprises identifying a patient- 
having colon cancer that is not known to have metastasized, 
periodically measuring levels of CSG in a sample of cells, 
tissues, or bodily fluid obtained from the patient, and 
comparing the measured CSG levels with levels of CSG m 
preferably the same cells, tissue, or bodily fluid type of a 
control, wherein an increase in measured CSG levels versus 
control CSG levels is associated with a cancer which has 
metastasized . 

Yet another object of the present invention is to 
provide a method of monitoring the change in stage of colon 
cancer in a patient which comprises identifying a patient 
having colon cancer, periodically measuring levels of CSG in 
a sample ceils, tissue, or bodily fluid obtained from the 
patient, and comparing the measured CSG levels with levels of 
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CSG in preferably the same cells, tissues, or bodily fluid 
type of a control wherein an increase in measured CSG levels 
versus the control CSG levels is associated with a cancer 
which is progressing and a decrease in the measured CSG levels 
versus the control CSG levels is associated with a cancer 
which is regressing or in remission. 

Other objects, features, advantages and aspects of the 
present invention will become apparent to those of skill in 
the art from the followi^o: description. It should be 
understood, however, that the following description and the 
specific examples, while indicating preferred embodiments of 
the invention, are given by way of illustration only. Various 
changes and modifications within the spirit and scope of the 
disclosed invention will become readily apparent to those 
skilled in the art from reading the following description and 
from reading the other parts of the present disclosure. 

DESCRIPTION OF THE INVENTION 

The present invention relates to diagnostic assays and 
methods, both quantitative and qualitative for detecting, 
diagnosing, monitoring, staging, and prognosticating cancers 
by comparing levels of CSG with those of CSG in a normal human 
control. What is meant by "levels of CSG" as used herein, 
means levels of the native protein expressed by the genes 
comprising the polynucleotide sequence of any of SEQ ID NO: 
1, 2, 3, 4, 5, 6, 7, 8 or 9. In the alternative, what is 
meant by "levels of CSC" as used herein, means levels of the 
native mRNA encoded by any of the genes comprising any of the 
polynucleotide sequences of SEQ ID NO: 1, 2, 3, 4, 5, 6, 7, 
8 or 9 or levels of the gene comprising any of the 
polynucleotide sequence of SEQ ID NO: 1, 2, 3, 4, 5, 6, 7, 8 
or 9. Such levels are preferably measured in at least one of, 
cells, tissues and/or bodily fluids, including determination 
of normal and abnormal levels. Thus, for instance, a 
diagnostic assay in accordance with the invention for 
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diagnosing over-expression of any one of the CSG proteins 
compared to normal control bodily fluids, cells, or tissue 
samples may be used to diagnose the presence of cancers, 
including colon cancer. Any of the nine CSGs may be measured 
alone in the methods of the invention, or all together or any 
combination of the nine. 

By "control" it is meant a human patient without cancer 
and/or ncn cancerous samples from the patient, also referred 
to herein as a normal human control; in the methods for 
diagnosing or monitoring for metastasis, control may also 
include samples from a human patienr. that is determined by 
reliable methods to have colon cancer which has no- 
met as t as i zed . 

All the methods of the present invention may optionally 
include measuring the levels of other cancer markers as well 
as CSG . Other cancer markers, in addition to CSG, useful in 
the present invention will depend on the cancer being tested 
and are known to those of skill in the art. 
Diagnostic Assays 

The present invention provides methods for diagnosing 
the presence of colon cancer by analyzing for changes in 
levels of CSG in cells, tissues or bodily fluids compared with 
levels of CSG in cells, tissues or bodily fluids of preferably 
the same type from a normal human control, wherein an increase 
in levels of CSG in the patient versus the normal human 
control is associated with the presence of colon cancer. 
Without limiting the instant invention, typically, for a 
quantitative diagnostic assay a positive result indicating the 
patient being rested has cancer is one in which cells, 
tissues, or bodily fluid levels of the cancer marker, such as 
CSG, are at least two times higher, and most preferably are 
at least five times higher, than in preferably the same cells, 
tissues, or bodily fluid of a normal human control. 

The present invention also provides a method of 
diagnosing metastatic colon cancer in a patient having colon 
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cancer which has not yet metastasized for the onset of 
metastasis. In the method of the present invention, a human 
cancer patient suspected of having colon cancer which may have 
metastasized (but which was not previously known to have 
metastasized) is identified. This is accomplished by a 
variety of means known to those of skill in the art. For 
example, in the case of colon cancer, patients are typically 
diagnosed with colon cancer following traditional detection 
methods . 

In the present invention, determining the presence of 
CSG level in cells, tissues, or bodily fluid, is particularly 
useful for discriminating between colon cancer which has not 
metastasized and colon cancer which has metastasized. 
Existing techniques have difficulty discriminating between 
colon cancer which has metastasized and colon cancer which has 
not metastasized and proper treatment selection is often 
dependent upon such knowledge. 

In the present invention, the cancer marker levels 
measured in such cells, tissues, or bodily fluid is CSG, and 
are compared with levels of CSG in preferably the same cells, 
tissue, or bodily fluid type of a normal human control. That 
is, if the cancer marker being observed is just CSG in serum, 
this level is preferably compared with the level of CSG in 
serum of a normal human patient. An increase in the CSG in 
the patient versus the normal human control is associated with 
colon cancer which has metastasized. 

Without limiting the instant invention, typically, for 
a quantitative diagnostic assay a positive result indicating 
the cancer in the patient being tested or monitored has 
metastasized is one in which cells, tissues, or bodily fluid 
levels of the cancer marker, such as CSG, are at least two 
times higher, and most preferable are at least five times 
higher, than in preferably the same cells, tissues, or bodily 
fluid of a normal patient. 
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Normal human control as used herein includes a human 
patient without cancer and/or non cancerous samples from the 
patient; in the methods for diagnosing or monitoring for 
metastasis, normal human control may also include samples from 
a human patient that is determined by reliable methods to have 
colon cancer which has not metastasized. 
Staging 

The invention also provides a method of staging colon 
cancer in a human patient. 

The method comprises identifying a human patient having 
such cancer; analyzing a sample of cells, tissues, or bodily 
fluid from such patient for CSG. Then, the method compares 
CSG levels in such ceils, tissues, or bodily fluid with levels 
of CSG in preferably the same cells, tissues, or bodily fluid 
type of a normal human control sample, wherein an increase in 
CSG levels in the patient versus the normal human control is 
associated with a cancer which is progressing and a decrease 
in the levels of CSG is associated with a cancer which is 
regressing or ir. remission. 
Monz toxing 

Further provided is a method of monitoring colon cancer 
in a human having such cancer for the onset of metastasis. 
The method comprises identifying a human patient having such 
cancer that is not known to have metastasized; periodically 
analyzing a sample of cells, tissues, or bodilv fluid from 
such patient for CSG; comparing the CSG levels in such cells, 
tissue, or bodily fluid with levels of CSG in preferably ths 
same cells, tissues, or bodily fluid type of a normal human 
control sample, wherein an increase in CSG levels in the 
patient versus the normal human control is associated with a 
cancer which has metastasized. 

Further provided by this inventions is a method of 
monitoring the change in stage of colon cancer in a human 
having such cancer. The method comprises identifying a human 
patient having such cancer; periodically analyzing a sample 
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of cells, tissues, or bodily fluid from such patient for CSG; 
comparing the CSG levels in such cells, tissue, or bodily 
fluid with levels of CSG in preferably the same cells, 
tissues, or bodily fluid type of a normal human control 
sample, wherein an increase in CSG levels in the patient 
versus the normal human control is associated with a cancer 
which is progressing in stage and a decrease in the levels of 
CSG is associated with a cancer which is regressing in stage 
or in remission. 

Monitoring such patient for onset of metastasis is 
periodic and preferably done on a quarterly basis. However, 
this may be more or less frequent depending on the cancer, the 
particular patient, and the stage of the cancer. 
Assay Techniques 

Assay techniques that can be used to determine levels 
of gene expression, such as CSG of the present invention, in 
a sample derived from a host are well-known to those of skill 
in the art. Such assay methods include radioimmunoassays, 
reverse transcriptase PCR (RT-PCR) assays, 

immunohistochemistry assays, in situ hybridization assays, 
competitive-binding assays, Western Blot analyses and ELISA 
assays. Among these, ELISAs are frequently preferred to 
diagnose a gene's expressed protein in biological fluids. 
An ELISA assay initially comprises preparing an antibody, if 
not readily available from a commercial source, specific to 
CSG, preferably a monoclonal antibody. In addition a reporter 
antibody generally is prepared which binds specifically to 
CSG. The reporter antibody is attached to a detectable 
reagent such as radioactive, fluorescent or enzymatic reagent, 
for example horseradish peroxidase enzyme or alkaline 
phosphatase . 

To carry out the ELISA, antibody specific to CSG is 
incubated on a solid support, e.g., a polystyrene dish, that 
binds the antibody. Any free protein binding sites on the 
dish are then covered by incubating with a non-specific 
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protein such as bovine serum albumin. Next, the sample to be 
analyzed is incubated in the dish, during which time CSG binds 
to the specific antibody attached to the polystyrene dish. 
Unbound sample is washed uul with buffer. A reporter antibody 
specifically directed to CSG and linked to horseradish 
peroxidase is placed in the dish resulting in binding of the 
reporter antibody to any monoclonal antibody bound to CSG. 
Unattached reporter antibody is then washed out. Reagents for 
peroxidase activity, including a coiorimetric substrate are 
then added to the dish. Immobilized peroxidase, linked to CSG 
antibodies, produces a colored reaction product. The amount 
of color developed in a given time period is proportional to 
the amour.: of CSG protein present in the sample. Quantitative 
results typically are obtained by reference to a standard 
curve . 

A competition assay may be employed wherein antibodies 
specific to CSG attached to a solid support and labeled CSG 
and a sample derived from the host are passed over the solid 
support and the amount of label detected attached to the solid 
support can be correlated to a quantity of CSG in the sample. 
Nucleic acid methods may be used to detect CSG mRNA as a 
marker for colon cancer. Polymerase chain reaction (PCR) and 
other nucleic acid methods, such as ligase chain reaction 
(LCR) ana nucleic acid sequence based amplification (NASABA) , 
can be used to detect malignant cells for diagnosis and 
monitoring of various malignancies. For example, reverse- 
transcriptase PCR (RT-PCR) is a powerful technique which can 
be used to detect the presence of a specific mRNA population 
in a complex mixture of thousands of other mRNA species. In 
RT-PCR, an mRNA species is first reverse transcribed to 
complementary DNA (cDNA) with use of the enzyme reverse 
transcriptase; the cDNA is then amplified as in a standard PCR 
reaction. RT-PCR can thus reveal by amplification the 
presence of a single species of mRNA. Accordingly, if the 
mRNA is highly specific for the cell that produces it, RT-PCR 
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can be used to identify the presence of a- specific type of 
cell . 

Hybridization to clones or oligonucleotides arrayed on 
a solid support (i.e., gridding) can be used to both detect 
the expression of and quantitate the level of expression of 
;|f that gene. In this approach, a cDNA encoding the CSG gene is 

fixed to a substrate. The substrate may be of any suitable 
type including but not limited to glass, nitrocellulose, nylon 
or plastic. At least a portion of the DNA encoding the CSG 
gene is attached to the substrate and then incubated with the 
analyte, which may be RNA or a complementary DNA (cDNA) copy 
of the RNA, isolated from the tissue of interest. 
Hybridization between the substrate bound DNA and the analyte 
can be detected and quantitated by several means including but 
not limited to radioactive labeling or fluorescence labeling 
of the analyte or a secondary molecule designed to detect the 
hybrid. Quantitation of the level of gene expression can be 
done by comparison of the intensity of the signal from the 
analyte compared with that determined from known standards. 
The standards can be obtained by in vitro transcription of the 
target gene, quantitating the yield, and then using that 
material to generate a standard curve. 

The above tests can be carried out on samples derived from a 
variety of patients' cells, bodily fluids and/or tissue 
extracts (homogenates or solubilized tissue) such as from 
tissue biopsy and autopsy material. Bodily fluids useful in 
the present invention include blood, urine, saliva, or any 
other bodily secretion or derivative thereof. Blood can 
include whole blood, plasma, serum, or any derivative of 
blood. 



EXAMPLES 

The present invention is further described by the 
following examples. These examples are provided solely to 
illustrate the invention by reference to specific embodiments. 
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These exemplifications, while illustrating certain specific 
aspects of the invention, do not portray the limitations or 
circumscribe the scope of the disclosed invention. 



I 
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Example 1 : CSGs 

Searches were carried out and CSGs identified using the 
following Search Tools as part of the LIFESEQ® database 
available from Incyte Pharmaceuticals, Palo Alto, CA: 

1. Library Comparison (compares one library to one 
other library) allows the identification of clones expressed 
in tumor and absent or expressed at a lower level in normal 
tissue . 

2. Subsetting is similar to library comparison but 
allows the identification of clones expressed in a pool of 
libraries and absent or expressed at a lower level in a second 
pool of libraries. 

3. Transcript Imaging lists all of the clones in a 
single library or a pool of libraries based on abundance. 
Individual clones can then be examined using Electronic 
Northerns to determine the tissue sources of their component 
ESTs. 

4. Protein Function: Incyte has identified subsets of 
ESTs with a potential protein function based on homologies to 
known proteins. Some examples in this database include 
Transcription Factors and Proteases. We identified some leads 
by searching in this database for clones whose component ESTs 
showed disease specificity. 

Electronic subtractions, transcript imaging and protein 
function searches were used to identify clones, whose 
component ESTs were exclusively or more frequently found in 
libraries from specific tumors. Individual candidate clones 
were examined in detail by checking where each EST originated. 
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SEQ ID 


Clone ID # 


Gene ID # 




NO: 








1 


238330 


242807 


Transcript Imaging 


2 


1285234 


239588 


Subsetting 


3 


134 1701 


29634 


Transcript Imaging 


4 


816257 


233421 


Subsetting 


5 


775133 


245080 


Subsetting 


6 


1335450 


245811 


Subsetting 


7 


2348122 


233711 


Transcript Imaging 


8 


3228674 


230273 


Subsetting 


9 


1632174 


229022 


Transcript Imaging 



The following example was carried out using standard 
techniques, which are well known and routine to those of skill 
in the art, except where otherwise described in detail. 
Routine molecular biology techniques of the following example 
can be carried out as described in standard laboratory 
manuals, such as Sambrook et al . , MOLECULAR CLONING: A 
LABORATORY MANUAL, 2nd Ed.; Cold Spring Harbor Laboratory 
Press, Cold Spring Harbor, N.Y. (1989) . 

Example 2 : Relative Quantitation of CSG Gene Expression 

Real-Time quantitative PCR with fluorescent Taqman 
probes is a quantitation detection system utilizing the 5'- 
3' nuclease activity of Taq DNA polymerase. The method uses 
an internal fluorescent oligonucleotide probe (Taqman) labeled 
with a 5' reporter dye and a downstream, 3' quencher dye. 
During PCR, the 5' -3' nuclease activity of Taq DNA polymerase 
releases the reporter, whose fluorescence can then be detected 
by the laser detector of the Model 7700 Sequence Detection 
System (PE Applied Biosystems, Foster City, CA, USA) . 
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Amplification of an endogenous control is used to 
standardize the amount of sample RNA added to the reaction and 
normalize for Reverse Transcriptase (RT) efficiency. Either 
cyclophilin, glyceraldehyde-3-phosphate dehydrogenase (GAPDH) 
or 18S ribosomal RNA (rRNA) is used as this endogenous 
control. To calculate relative quantitation between all the 
samples studied, the target RNA levels for one sample are used 
as the basis for comparative results (calibrator) . 
Quantitation relative to the "calibrator" can be obtained 
using the standard curve method or the comparative method 
(User Bulletin #2: ABI PRISM 7700 Sequence Detection System). 

To evaluate the tissue distribution, and the level of 
CSGs in normal and tumor tissue, total RNA was extracted from 
normal tissues, tumor tissues, and from tumors and the 
^ corresponding matched normal tissues. Subsequently, first 

strand cDNA was prepared with reverse transcriptase and the 
polymerase chain reaction was done using primers and Taqman 
probe specific to the CSG. The results were analyzed using 
the ABI PRISM 7700 Sequence Detector. The absolute numbers 
are relative levels of expression of the CSG compared to the 
calibrator. 
Comparative Examples 

Similar mRNA expression analysis for genes coding for 
the diagnostic markers PSA (Prostate Specific Antigen) and 
PLA2 (Phospholipase A2 ) was performed for comparison. PSA xs 
currently the only cancer screening marker available in 
clinical laboratories. When the panel of nonual pooled 
tissues was analyzed, PSA was expressed at very high levels 
in prostate, with a very low expression in breast and testis. 
After analysis of more than 55 matching samples from 14 
different tissues, the data corroborated the tissue 
specificity seen with normal tissue samples. PSA expression 
was compared in cancer and normal adjacent tissue for 12 
matching samples of prostate tissue. The relative levels of 
PSA were higher in 10 cancer samples (83%). Clinical data 
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recently obtained support the utilization of PLA2 as a staging 
marker for late stages of prostate cancer. mRNA expression 
data described herein showed overexpression of the mRNA in 8 
out of the 12 prostate matching samples analyzed (66%). PLA2 
had high levels of mRNA expression in small intestine, 
prostate, liver, and pancreas. 



Measurement of SEQ ID NO: 3/ Clone ID 1341701; Gene ID 2 9634 
(Clnl06) 

Absolute numbers are depicted in Table 2 as relative 
levels of expression of Clnl06 ( SEQ ID N0:3) in 12 normal 
different tissues. All the values are compared to normal 
testis (calibrator) . These RNA samples are commercially 
available pools, originated by pooling samples of a particular 
tissue from different individuals. 

Table 2: Relative levels of Clnl06 Expression in Pooled 



Samples 



Tissue 


NORMAL 


Colon-Ascending 


110 


Endometrium 


0 


Kidney 


0 


Liver 


0 


Ovary 


0 


Pancreas 


0 


Prostate 


16 


Small Intestine 


0 


Spleen 


0 


Stomach 


0 


Testis 


1 


Uterus 


0 



The relative levels of expression in Table 2 show for 
the CSG Clnl06 (SEQ ID NO: 3), mRNA expression is more than 6 
fold higher in the pool of normal ascending colon (110) 
compared with prostate (16) . Testis, the calibrator, with a 
relative expression level of 1, is the only other tissue 
expressing the mRNA for Clnl06 (SEQ ID NO: 3). These results 
demonstrate that mRNA expression of this CSG is highly 
specific for colon. 
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The absolute numbers in Table 2 were obtained analyzing 
pools of samples of a particular tissue from different 
individuals. They can not be compared to the absolute numbers 
originated from RNA obtained from tissue samples of a single 
individual in Table 3. 

The absolute numbers in Table 3 are relative levels of 
expression of Clnl06 (SEQ ID NO: 3) in 57 pairs of matching 
samples . All the values are compared to normal testis 
(calibrator) . A matching pair is formed by mRNA from the 
cancer sample for a particular tissue and mRNA from the normal 
adjacent sample for that same tissue from the same individual. 
Table 3: Relative levels of Clnl06 Expression in Individual 



Samples 



Sample ID 


Tissue 


Cancer 


Matching 

Normal 
Adjacent 


Sto AC93 


Stomach 1 


4 


96 


Sto AC99 


Stomach 2 


0.4 


0 . 5 


Sml 21XA 


Small Intestine 1 


0 


0 


Sml H8 9 


Small Intestine 2 


0.93 


1.28 


Cln B56 


Colon-Cecum (A) 1 


317 


101 


Cln AS45 


Colon-Ascending (A) 2 


316.3 


146.5 


Cln CM67 


Colon-Cecum (B) 3 


481.0 


217 . 5 


Cln AS67 


Colon-Ascending (B) 4 


858 .1 


220. 6 


Cln AS43 


Colon-Ascending (C) 5 


1370 


98 


Cln AS46 


Colon-Ascending (C) 6 


3051 


375 


Cln AS98 


Colon-Ascending (C) 7 


26 


42 


Cln AS89 


Colon-Ascending (D) 8 


524 . 6 


11 . 0 


Cln TX01 


Colon-Transverse (B) 9 


2886.3 


1992.0 


Cln TX89 


Colon-Transverse (B) 10 


146.0 


35. 9 


Cln TX67 


Colon-Transverse (C) 11 


2.9 


421.7 


Cln MT38 


Colon-Splenic 
Flexture (M) 12 


1681 


187 
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Cln SG89 


Colon-Sigmoid (B) 13 


1063. 8 


31 . 1 


Cln SG67 


Colon-Sigmoid (C) 14 


8.5 


9.4 


Cln SG33 


Colon-Sigmoid (C) 15 


264 


549 


Cln SG45 


Colon-Sigmoid ( D) 16 


580. 0 


114 . 6 


Cln B34 


Colon-Rectosigmoid (A) 17 


97 


244 


Cln CXGA 


Colon-Rectum (A) 18 


45. 1 


273.4 


Cln RC67 


Colon-Rectum (B) 19 


2.7 


20.0 


Cln C9XR 


Colon-Rectosigmoid (C ) 20 


609 


460 


Cln RS45 


Colon-Rectosigmoid (C) 21 


472. 8 


14 4.0 


Cln RC01 


Colon-Rectum (C) 22 


568 


129 


Cln RC89 


Colon-Rectum (D) 23 


4 . 6 


322 . 91 


Bid 46XK 


Bladder 1 


0.2 


0 


Bid 66X 


Bladder 2 


1 


1 


Bid 32XK 


Bladder 3 


0.0 


0.0 


Kid 126XD 


Kidney 1 


0 


0 


Kid 12XD 


Kidney 2 


0 


0 


Kid 5XD 


Kidney 3 


0.0 


1.0 


Kid 6XD 


Kidney 4 


0.0 


0.0 


Kid 106XD 


Kidney 5 


0.4 


0.0 


Liv 42X 


Liver 1 


0.0 


0. 0 


Liv 15XA 


Liver 2 


0.0 


0.0 


Liv 94XA 


Liver 3 


0.0 


0.0 


Lng AC 6 9 


Lung 1 


2 


0 


Lng BR94 


Lung 2 


0 


0 


Lng 47XQ 


Lung 3 


0 


0 


Mam 59X 


Mammary Gland 1 


0 


0 


Mam B011X 


Mammary Gland 2 


0 


0 


Mam A0 6X 


Mammary Gland 3 


0 


0 


Ovr 103X 


Ovary 1 


0.04 


2.08 


Ovr 13 OX 


Ovary 2 


0.1 


2.76 
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Pan 71XL 


Pancreas 1 


4 .08 


0.1 


Pan 82XP 


Pancreas 2 


0 


0 


Pro 12B 


Prostate 1 


0.3 


0 


Pro 23B 


Prostate 2 


3 


4 


Pro 13XB 


Prostate 3 


2 


7 


Pro 34B 


Prostate 4 


0. 54 


4 .01 


Pro 20XB 


Prostate 5 


4 . 8 


4 . 3 


Pro 65XB 


Prostate 6 


0 . 7 


1. 3 


Tst 39X 


Testis 1 


2 .78 


0 


End 8XA 


Endometrium 1 


0 


0.2 


Utr 85XU 


Uterus 1 


1.26 


0 



0= Negative 



When matching samples were analyzed, the higher levels 
of expression were in the colon, showing a high degree of 
tissue specificity for this tissue. These results confirm the 
tissue specificity results obtained with the panel of normal 
pooled samples (Table 2) , Furthermore, the level of mRNA 
expression in cancer samples and the isogenic normal adjacent 
tissue from the same individual were compared. This 
comparison provides an indication of specificity for the 
cancer stage (e.g. higher levels of mRNA expression in the 
cancer sample compared to the normal adjacent) . Table 3 shows 
overexpression of Clnl06 (SEQ ID NO: 3) in 15 colon cancer 
tissues compared with their respective normal adjacent (colon 
samples 'il, 2, 3, 4, 5, 6, 8, 9, 10, 12, 13, 16, 20, 21, and 
22) . There is overexpression in the cancer tissue for 65% of 
the colon matching samples tested (total of 23 colon matching 
samples). The matching sample Pan 71XL is a secondary cancer 
in pancreas, the primary cancer in that individual was a 
duodenal cancer. 

Altogether, the high level of tissue specificity, plus 
the mRNA overexpression in 65% of the colon matching samples 



BNSDOCID: <WO 9960161A1 I . > 



WO 99/60161 PCT/US99/10498 

- 20 - 

tested are demonstrative of CSG Clnl06 (SEQ ID NO : 3 ) being a 
diagnostic marker for colon cancer. 

Measurement of SEQ ID NO: 4; Clone ID 816257; Gene ID 406452 
(Clnl07) 

Absolute numbers as depicted in Table 4 are relative 
levels of expression of CSG Clnl07 (SEQ ID N0:4) in 12 normal 
different tissues. All the values are compared to normal 
small intestine (calibrator) . These RNA samples are 

commercially available pools, originated by pooling samples 
of a particular tissue from different individuals. 
Table 4: Relative levels of Clnl07 Expression in Pooled 



Samples 



Tissue 


NORMAL 


Colon-Ascending 


3.2 


Endometrium 


0 


Kidney 


0.2 


Liver 


0 


Ovary 


0 


Pancreas 


0 


Prostate 


0 . 1 


Small Intestine 


1 


Spleen 


0 


Stomach 


0.3 


Testis 


0 


Uterus 


0 



The relative levels of expression in Table 4 show than 
mRNA expression of the CSG Clnl07 (SEQ ID NO: 4) is more than 
10 fold higher in the pool of normal ascending colon (3.2), 
five fold higher in small intestine (1), and 1.5 fold higher 
in stomach (0.3), compared with the next higher expressor (0.2 
for kidney) . Seven of the pooled tissues samples analyzed 
were negative and prostate showed a relative expression of 0.1 
for Clnl07 (SEQ ID NO : 4 ) . These results demonstrate that: 
Clnl07 mRNA expression is highly specific for colon, small 
intestine, and in a lower degree for stomach. 

The absolute numbers in Table 4 were obtained analyzing 
pools of samples of a particular tissue from different 
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individuals. They can not be compared to the absolute numbers 
originated from RNA obtained from tissue samples of a single 
individual in Table 5. 

The absolute numbers in Table 5 are relative levels of 
expression of Cinl07 (SEQ ID NO: 4) in 57 pairs of matching 
samples. All the values are compared to normal small 
intestine (calibrator) . A matching pair is formed by mRNA 
from the cancer sample for a particular tissue and mRNA from 
the normal adjacent sample for that same tissue from the same 
individual . 

Table 5 : Relative levels of Clnl07 Expression in Individual 



Samples 



Sample ID 


Tissue 


Cancer 


Matching 

Normal 
Adjacent 


Sto AC93 


Stomach 1 


8.9 


13. 4 


Sto AC99 


Stomach 2 


6.0 


0.9 


Sml 21XA 


Small Intestine 1 


1.07 


1 .42 


Sml H89 


Small Intestine 2 


0. 97 


4 . 13 


Cln B56 


Col on -Cecum (A) 1 


2 


16 


Cln AS45 


Colon-Ascending (A) 2 


0.7 


2 . 1 


Cln CM67 


Colon-Cecum (B) 3 


1.6 


2. 1 


Cln AS67 


Colon-Ascending (B) 4 


1.2 


6.2 


Cln AS43 


Colon-Ascending (C) 5 


13.5 


0. 5 


Cln AS46 


Colon-Ascending (C) 6 


9.7 


23. 6 


Cln AS98 


Colon-Ascending (C) 7 


28.1 


1 . 4 


Cln AS89 


Colon-Ascending ( D) 8 


0.9 


3. 1 


Cln TX01 


Colon-Transverse (B) 9 


3.0 


10.6 


Cln TX89 


Colon-Transverse (B) 10 


4 . 5 


0.6 


Cln TX67 


Colon-Transverse (C) 11 


3 . 6 


3.4 


Cln MT38 


Colon-Splenic 
Flexture (M) 12 


4 . 0 


2.6 


Cln SG89 


Colon-Sigmoid (B) 13 


4 . 7 


0. 9 
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Cln SG67 


Colon-Sigmoid (C) 14 


1 . 0 


1-3 


Cln SG33 


Colon-Sigmoid (C) 15 


14.2 


7 . 6 


Cln SG45 


Colon-Sigmoid (D) 16 


4 . 8 


6.0 


Cln B34 


Colon-Rectosigmoid (A) 17 


3 


2 


Cln CXGA 


Colon-Rectum (A) 18 


4 . 4 


1 . 9 


Cln RC67 


Colon-Rectum (B) 19 


0. 1 


0 . 4 


Cln C9XR 


Colon-Rectosigmoid (C) 20 


5 


3 


CI" RS45 


Colon-Rectosigmoid (C) 21 


11.4 


4 . 6 


Cln RC01 


Colon-Rectum (C) 22 


1.8 


2 . 3 


Cln RC89 


Colon-Rectum(D) 23 


0.1 


5 .35 


Bid 4 6XK 


Bladder 1 


0.2 


0 


Bid 66X 


Bladder 2 


1 


1 


Bid 32XK 


Bladder 3 


0 . 1 


0 . 1 


Kid 126XD 


Kidney 1 


0 


0. 02 


Kid 12XD 


Kidney 2 


0.1 


0 . 2 


Kid 5XD 


Kidney 3 


0.3 


0 . 0 


Kid 6XD 


Kidney 4 


0.1 


0. 1 


Kid 106XD 


Kidney 5 


0.0 


0 . 1 


Liv 42X 


Liver 1 


7 . 9 


0 . 002 


Liv 15XA 


Liver 2 


0.0 


0. 0 


Liv 94XA 


Liver 3 


0.0 


0 . 0 


Lng AC69 


Lung 1 


1.6 


0 . 2 


Lng BR94 


Lung 2 


0.4 


0 


Lng 47XQ 


Lung 3 


0.78 


0 . 2 


Mam 5 9X 


Mammary Gland 1 


0. 05 


0.3. 


Mam B011X 


Mammary Gland 2 


0.01 


0 . 004 


Mam A06X 


Mammary Gland 3 


0.22 


0 


Ovr 103X 


Ovary 1 


0.01 


0.01 


Ovr 130X 


Ovary 2 


0.09 


0. 1 


Pan 71XL 


Pancreas 1 


2.51 


2.81 
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Pan 82XP 


Pancreas 2 


0 


0. 62 


Pro 12B 


Prostate 1 


0 . 3 


0.1 


Pro 23B 


Prostate 2 


0.3 


0.2 


Pro 13XB 


Prostate 3 


0 


0 


Pro 34B 


Prostate 4 


0 . 04 


0.22 


Pro 20XB 


Prostate 5 


0.4 


0. 1 


Pro 65XB 


Prostate 6 


0.0 


0. 1 


Tst 39X 


Testis 1 


0 . 02 


0.01 


End 8XA 


Endometrium 1 


0 . 01 


0.5 


Utr 85XU 


Uterus 1 


0.03 


0 



0= Negative 

When matching samples were analyzed, the higher levels 
of expression were in colon, stomach, and small intestine, 
showing a high degree of tissue specificity for colon tissues. 
These results confirm the tissue specificity results obtained 
with normal pooled samples (Table 4) . Furthermore, the level 
of mRNA expression in cancer samples and the isogenic normal 
adjacent tissue from the same individual were compared. This 
comparison provides an indication of specificity for the 
cancer stage {e.g. higher levels of mRNA expression in the 
cancer sample compared to the normal adjacent) . Table 5 shows 
overexpression of Clnl07 (SEQ ID NO : 4 ) in 11 colon cancer 
tissues compared with their respective normal adjacent (colon 
samples #5, 7, 10, 11, 12, 13, 15, 17, 18, 20, and 21). There 
is overexpression in the cancer tissue for 48% of the colon 
matching samples tested (total of 23 colon matching samples) . 
The matching sample Pan 71XL is a secondary cancer in 
pancreas, the primary cancer in that individual was a duodenal 
cancer. 

Altogether, the high level of tissue specificity, plus 
the mRNA overexpression in almost half of the colon, stomach, 
and small intestine matching samples tested are demonstrative 
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of CSG Clnl07 (SEQ ID NO: 4) being a diagnostic marker for 
colon cancer. 

Measurement of SEQ ID NO: 5; Clone ID 775133; Gene ID 24508 
(Clnl08) 

The absolute numbers shown in Table 6 are relative 
levels of expression of CSG ClnlOS (SEQ ID NO: 5) in 12 normal 
different tissues. All the values are compared to normal 
small intestine (calibrator) . These RNA samples are 

commercially available pools, originated by pooling samples 
of a particular tissue from different individuals. 
Table 6: Relative levels of Clnl08 Expression in Pooled 



Samples 



Tissue 


NORMAL 


Co Ion -Ascending 


2846.5 


Endometrium 


1 


Kidney 


5.5 


Liver 


18 . 7 


Ovary 


3.4 


Pancreas 


198 . 1 


Prostate 


1024 


Small Intestine 


810. 8 


Spleen 


32.2 


Stomach 


9981.2 


Testis 


0 


Uterus 


294 . 1 



The relative levels of expression in Table 6 show that mRNA 
expression of CSG Clnl08 (SEQ ID NO: 5) is more than 10 fold 
higher in the pool of normal ascending colon (2846.5) and 
almost ten fold higher in stomach (9981.2), compared to the 
expression level in any other tissue analyzed. These results 
demonstrate that mRNA expression of this CSG is also highly 
specific for colon and stomach. 

The absolute numbers in Table 6 were obtained analyzing 
pools of samples of a particular tissue from different 
individuals. They can not be compared to the absolute numbers 
originated from RNA obtained from tissue samples of a single 
individual in Table 7. 
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The absolute numbers depicted in Table 7 are relative 
levels of expression of Clnl08 (SEQ ID NO:5) in 57 pairs of 
matching samples. All the values are compared to normal small 
intestine (calibrator) . A matching pair is formed by mRNA 
from the cancer sample for a particular tissue and mRNA from 
the normal adjacent sample for that same tissue from the same 
individual . 

Table 7: Relative levels of Clnl08 Expression in Individual 



Samples 



Sample ID 


Tissue 


Cancer 


Matching 

Noinnal 
Adjacent 


Sto AC93 


Stomach 1 


28696 


34842 


Sto AC99 


Stomach 2 


1 1 oZj5 


O C\ o r o 


Sml 21XA 


Small Intestine 1 


1 Q A A 

z y 4 4 




Sml H8 9 


Small Intestine 2 


O A A c: 


J D 1 J5 . / 


Cln Bob 


Colon— Cecum (A) i 


9 7 9 4 9 


c, H O O 1 


Cln AS45 


Colon-Ascending (A) 2 


5827 . 0 


8771 . 0 


Cln CM67 


Colon-Cecum(B) 3 


4251.0 


4684 . 0 


Cln AS67 


Colon-Ascending (B) 4 


564 . 0 


1949. 0 


Cln AS43 


Colon-Ascending (C) 5 


50310 


10949 


Cln AS46 


Colon-Ascending (C) 6 


246044 


120073 


Cln AS98 


Colon-Ascending (C) 7 


40442 


17482 


Cln AS89 


Colon-Ascending (D) 8 


5730.0 


1581 . 0 


Cln TX01 


Colon-Transverse (B) 9 


22281.0 


114784 . 0 


Cln TX89 


Colon-Transverse (B) 10 


11026.0 


1639. 0 


Cln TX67 


Colon-Transverse (C) 11 


17004.0 


11654 .0 


Cln MT38 


Co Ion -Splenic 
Flexture (M) 12 


77589 


31620 


Cln SG89 


Colon-Sigmoid (B) 13 


140339.0 


49617 .0 


Cln SG67 


Colon-Sigmoid (C) 14 


4951.0 


7905. 0 


Cln SG33 


Colon-Sigmoid (C) 15 


60875 


120490 
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Cln SG45 


Colon-Sigmoid (D) 16 


30437. 0 


47267.0 


Cln B34 


Colon-Rectosigmoid (A) 17 


5848 


5861 


Cln CXGA 


Colon-Rectum (? N 1 8 


13877.0 


9787 .0 


Cln RC67 


Colon-Rectum(B) 19 


1703. 0 


26589. 0 


Cln C9XR 


Colon-Rectosigmoid (C) 20 


2458 


19071 


Cln RS45 


Colon-Rectosigmoid (C) 21 


95523 


61939 


Cln RC01 


Colon-Rectum (C) 22 


98891. 0 


80047 . 0 


Cln RC89 


Colon-Rectum (D^ 23 


17.0 


1775 


Bid 4 6XK 


Bladder 1 


0 


8 


Bid 66X 


Bladder 2 


397 


44 


Bid 32XK 


Bladder 3 


0.0 


16. 0 


Kid 126XD 


Kidney 1 


32 


22 


Kid 12XD 


Kidney 2 


6 


0 


Kid 106XD 


Kidney 3 


4 . 0 


33. 0 


Liv 42X 


Liver 1 


4783 


0 


Liv 15XA 


Liver 2 


4 . 0 


10. 0 


Liv 94XA 


Liver 3 


159.0 


21.0 


Lng AC 6 9 


Lung 1 


222 


295 


Lng BR94 


Lung 2 


112 


0 


Lng 4 7XQ 


Lung 3 


30 


69 


Lng AC66 


Lung 4 


29 


137 


Mam 59X 


Mammary Gland 1 


56 


0 


Mam B011X 


Mammary Gland 2 


54 


31 


Mam A0 6X 


Mammary Gland 3 


12 


0 


Ovr 103X 


Ovary 1 


37 


0 


Pan 71XL 


Pancreas 1 


13203 


4163 


Pan 82XP 


Pancreas 2 


39.1 


0 


Pro 12B 


Prostate 1 


386 


88 


Pro 23B 


Prostate 2 


250 


23 


Pro 13XB 


Prostate 3 


92 


731 
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Pro 34B 


Prostate 4 


33.3 


265. 7 


Pro 20XB 


Prostate 5 


454 .6 


1908 . 9 


Pro 65XB 


Prostate 6 


733. 5 


922 . 0 


End 8XA 


Endometrium 1 


5 


92 


Utr 85XU 


Uterus 1 


98 . 9 


21 . 8 


Utr 23XU 


Uterus 2 


35. 3 


0 


Utr 135X0 


Uterus 3 


39.2 


43.8 


Uti 141X0 


Uterus 4 


212.1 


55. 9 



0= Negative 



When matching samples were analyzed, the higher levels 
of expression were in colon and stomach, showing a high degree 
of tissue specificity for these two tissues. These results 
confirm the tissue specificity results obtained with normal 
pooled samples (Table 6) . Furthermore, the level of mRNA 
expression in cancer samples and the isogenic normal adjacent 
tissue from the same individual were compared. This 
comparison provides an indication of specificity for the 
cancer stage (e.g. higher levels of mRNA expression in the 
cancer sample compared to the normal adjacent) . Table 7 shows 
overexpression of CSG Clnl08 (SEQ ID NO: 5) in 13 colon cancer 
tissues compared with their respective normal adjacent (colon 
samples #1, 5, 6, 7, 8, 9, 10, 11, 12, 13, 18, 21, and 22). 
There is overexpression in the cancer tissue for 56% of the 
colon matching samples tested (total of 23 colon matching 
samples) . The matching sample Pan 71XL is a secondary cancer 
in pancreas, the primary cancer in that individual was a 
duodenal cancer. 

Altogether, the high level of tissue specificity, plus 
the mRNA overexpression in more than half of the colon, 
stomach, and small intestine matching samples tested are 
demonstrative of this CSG, Clnl08 (SEQ ID NO:5), also being 
a diagnostic marker for colon cancer. 
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Measurement of SEQ ID NO : 7 ; Clone ID 2348122; Gene ID 23371 
(Clnl09) 

The absolute numbers depicted in Table 8 are relative 
levels of expression of CSG Clnl09 (SEQ ID NO: 7) in 12 normal 
different tissues. All the values are compared to normal 
ovary (calibrator) . These RNA samples are commercially 
available pools, originated by pooling samples of a particular 
tissue from different individuals. 

Table 8: Relative levels of Clnl0 9 Expression in Pooled 



Samples 



Tissue 


NORMAL 


Colon-Ascending 


28.8 


Endometrium 


0.45 


Kidney 


0.41 


Liver 


0.72 


Ovary 


0. 07 


Pancreas 


82. 8 


Prostate 


124.3 


Small Intestine 


626. 4 


Spleen 


1.2 


Stomach 


12 . 05 


Testis 


1.51 


Uterus 


52. 99 



The relative levels of expression in Table 8 show that mRNA 
expression of CSG Clnl09 (SEQ ID NO : 7 ) , is more the i 5 fold 
higher in the pool of normal small intestine (626.4) compared 
to the expression level in any other tissue analyzed. These 
results demonstrate that Clnl09 (SEQ ID NO: 7) mRNA expression 
is highly specific for small intestine 

The absolute numbers in Table 8 were obtained analyzing 
pools of samples of a particular tissue from different 
individuals. They can not be compared to the absolute numbers 
originated from RNA obtained from tissue samples of a single 
individual in Table 9. 

The absolute numbers depicted in Table 9 are relative 
levels of expression of Clnl09 (SEQ ID NO:7) in 53 pairs of 
matching samples. All the values are compared to normal ovary 
(calibrator) . A matching pair is formed by mRNA from the 
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cancer sample for a particular tissue and mRNA from the normal 
adjacent sample for that same tissue from the same individual. 
Table 9: Relative levels of Clnl09 Expression in Individual 



Samples 



Sample ID 


Tissue 


Cancer 


Matching 

Normal 
Ad j acent 


Sto AC93 


Stomach 1 


2 5/4 


I 3 1 U 


Sto AC99 


Stomach 2 


415 3 


r 

D 


Sml 21XA 


Small Intestine 1 


2 667 


13 663.8 


Sml H8 9 


Small Intestine 2 


5 / . o 


y u 4 . z y 


Cln B56 


Colon-Cecuiu (A) 1 


67 94 


o n n 

2 9 9 


Cln AS45 


Colon-Ascending (A) 2 


814 . 6 


105. 8 


Cln CM67 


Colon-Cecum (B) 3 


294 . 6 


36. 1 


Cln AS67 


Colon-Ascending (B) 4 


2.2 


26.3 


Cln AS43 


Colon-Ascending (C) 5 


111 


377 


Cln AS46 


Colon-Ascending (C) 6 


1180 


352 


Cln AS98 


Colon-Ascending (C) 7 


1075 


92 


Cln AS89 


Colon-Ascending (D) 8 


14022. 7 


87.5 


Cln TX01 


Colon-Transverse (B) 9 


1027.6 


282. 1 


Cln TX8 9 


Colon-Transverse (B) 10 


2.5 


23.7 


Cln TX67 


Colon-Transverse (C) 11 


0.1 


72 . 3 


Cln MT38 


Colon- Splenic 
Flexture (M) 12 


372 


88 


Cln SG89 


Colon-Sigmoid (B) 13 


179.2 


33.4 


Cln SG67 


Colon-Sigmoid (C) 14 


85.0 


94 . 7 


Cln SG33 


Colon-Sigmoid (C) 15 


5461 


377 


Cln SG45 


Colon-Sigmoid (D) 16 


762 .7 


15 . 9 


Cln B34 


Colon-Rectosigmoid (A) 17 


460 


1 


Cln RC67 


Colon-Rectum(B) 18 


64 .5 


136.2 


Cln C9XR 


Colon-Rectosigmoid (C) 19 


441 


34 


Cln RS45 


Colon-Rectosigmoid (C) 20 


1931 


195 
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Cln RC01 


Colon-Rectum (C) 21 


72.8 


19. 1 


Cln RC89 


Colon-Rectum (D) 22 


4 . 8 


90. 2 


Bid 46XK 


Bladder 1 


4 


3 


Bid 66X 


Bladder 2 


1 


0 


Bid 32XK 


Bladder 3 


0. 1 


307 . 6 


Kid 126XD 


Kidney 1 


0 


2 


Kid 12XD 


Kidney 2 


3 


16 


Kid 5XD 


Kidney 3 


0.0 


0.3 


Kid 6XD 


Kidney 4 


18 . 5 


1.2 


Liv 42X 


Liver 1 


21 


0 . 03 


Liv 15XA 


Liver 2 


0.5 


0.4 


Liv 94XA 


Liver 3 


0.4 


0.0 


Lng AC69 


Lung 1 


0.1 


0 


Lng BR94 


Lung 2 


3 


0 


Lng 60XL 


Lung 3 


0.1 


0 


Mam 5 9X 


Mammary Gland 1 


0 


4 


Mam B011X 


Mammary Gland 2 


8 


13 


Mam A06X 


Mammary Gland 3 


4 . 7 


9.6 


Pan 71XL 


Pancreas 1 


8902 . 5 


1428 .2 


Pan 82XP 


Pancreas 2 


0.2 


9.3 


Pro 12B 


Prostate 1 


9 


20 


Pro 23B 


Prostate 2 


191 


88 


Pro 13XB 


Prostate 3 


12 


460 


Pro 34B 


Prostate 4 


3.2 


80. 4 


Tst 39X 


Testis 1 


29. 9 


0 


End 8XA 


Endometrium 1 


0.3 


21 


Utr 85XU 


Uterus 1 


244 . 7 


592.2 


Ovr 63A 


Ovary 1 


11. 4 


0 


Ovr A1C 


Ovary 2 


68. 4 


0 



0^ Negative 
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When matching samples were analyzed, the higher levels 
of expression were in small intestine, colon and stomach, 
showing a high degree of tissue specificity for these three 
colon tissues. These results confirm the tissue specificity 
results obtained with normal pooled samples for small 
intestine (Table 8) . Furthermore/ the level of mRNA 
expression in cancer samples and the isogenic normal adjacent 
tissue from the same individual were compared. This 
comparison provides an indication of specificity for the 
cancer stage (e.g. higher levels of mRNA expression in the 
cancer sample compared to the normal adjacent) . Table 9 shows 
overexpression of CSG, Clnl09 (SEQ ID NO: 7) in 15 colon cancer 
tissues compared with their respective normal adjacent (colon 
samples #1, 2, 3, 6, 7, 8, 9, 12, 13, 15, 16, 17, 19, 20, and 
21) . There is overexpression in the cancer tissue for 68% of 
the colon matching samples tested (total of 22 colon matching 
samples) . The matching sample Pan 71XL is a secondary cancer 
in pancreas, the primary cancer in that individual was a 
duodenal cancer. 

Altogether, the high level of tissue specificity, plus 
the mRNA overexpression in more than half of the colon, 
stomach, and small intestine matching samples tested are 
demonstrative of CSG Clnl09 (SEQ ID NO: 7) being a diagnostic 
marker for colon cancer. The amino acid sequence encoded by 
the open reading frame of Clnl09 is depicted in SEQ ID NO: 10. 
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What: is claimed is : 

1 . A method for diagnosing the presence of colon cancer 
in a patient comprising: 

(a) measuring levels of CSG in a sample of cells, tissue 
or bodily fluid obtained from the patient; and 

(b) comparing the measured levels of CSG with levels of 
CSG in a sample of cells, tissue or bodily fluid obtained from 
a control, wherein an increase in measured levels of CSG in 
the patient versus the CSG leve 1 s in the control is associated 
with the presence of colon cancer. 



2. A method of diagnosing metastatic colon cancer in 
a patient comprising: 

(a) measuring levels of CSG in a sample of cells, 
tissue, or bodily fluid obtained from the patient; and 

(b) comparing the measured levels of CSG with levels of 
CSG in a sample of cells, tissue, or bodily fluid obtained 
from a control, wherein an increase in measured CSG levels in 
the patient versus the CSG levels in the control is associated 
with a cancer which has metastasized. 



3. A method of staging colon cancer in a patient 
comprising : 

(a) identifying a patient suffering from colon cancer; 

(b) measuring levels of CSG in a sample of cells, 
tissue, or bodily fluid obtained from the patient; and 

(c) comparing the measured levels of CSG with levels of 
CSG in a sample of cells, tissue, or bodily fluid obtained 
from a control, wherein an increase in the measured levels of 
CSG versus the levels of CSG in the control is associated with 
a cancer which is progressing and a decrease in the measured 
levels of CSG versus the levels of CSG in the control is 
associated with a cancer which is regressing or in remission. 
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4. A method of monitoring colon cancer in a patient 
for the onset of metastasis comprising: 

(a) identifying a patient having colon cancer that is 
not known to have metastasized; 

(b) periodically measuring CSG levels in samples of 
| cells, tissue, or bodily fluid obtained from the patient; and 
^ (c) comparing the periodically measured levels of CSG 

with levels of CSG in cells, tissue, or bodily fluid obtained 
- from a control, wherein an increase in any one of the 

periodically measured levels of CSG in the patient versus the 
levels of CSG in the control is associated with a cancer which 
has metastasized. 

5. A method of monitoring changes in a stage of colon 
II cancer in a patient comprising: 

(a) identifying a patient having colon cancer; 

(b) periodically measuring levels of CSG in samples of 
cells, tissue, or bodily fluid obtained from the patient; and 

(c) comparing the measured levels of CSG with levels of 
CSG in a sample of the same cells, tissue, or bodily fluid of 
a control, wherein an increase in any one of the periodically 
measured levels of CSG versus levels of CSG in the control 
is associated with a cancer which is progressing in stage and 
a decrease in any one of the periodically measured levels of 
CSG versus the levels of CSG in the control is associated with 
a cancer which is regressing in stage or in remission. 

6. The method of claim 1, 2, 3, 4 or 5 wherein the CSG 
comprises SEQ ID NO: 3, 4, 5 or 7 . 
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SEQUENCE LISTING 

<110> Macina, Roberto A. 
Yang, Fei 
Sun, Yongming 

<120> A Novel Method of Diagnosing, Monitoring and Staging 
Colon Cancers 

<130> DEX-0035 

<140> 
<141> 

<150> 60/086,266 
<151> 1998-05-21 

<160> 10 

<170> Patentln Ver. 2.0 

<210> 1 
<211> 487 
<212> DNA 

<213> Homo sapiens 
<400> 1 

tctgcatctg gccctcccag tgcacctgtt caatcccagc ycctccctga cctgtacaaa 60 
tacacctgag gaccggctcg agcccagact tcctgcccct gctctgcact ctcaggtatt 120 
ccctgctctt actccaaaaa gatggaccca ggtccgaagg ggcactgcca ctgtgggggg 180 
catggccatc ctccaggtca ctgcgggcga acccctggcc atggcccagg gccctgcggg 240 
ccaccccctg gccatggccc agggccctgc gggcaacccc ctggccatgg cccagggccc 300 
tgcgggcctc cccctggcca tggcccaggt cacccacccc ctggtccaca tcactgagga 360 
agtagaagaa aacaggacac aagatggcaa gcctgagaga attgcccagc tgacctggaa 420 
tgaggcctaa accacaatct tctcttccta ataaacagcc tcctagaggc cacattctat 480 
tctttaa 487 

<210> 2 

<211> 739 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> unsure 
<222> (693) 

<220> 

<221> unsure 
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<222> (696; . . (699) 
<220> 

<221> unsure 
<222> (703; . . (70S; 

<220> 

<221> unsure 
<222> (708; 

<220> 

<221> unsure 
<222> (710; . . ■: 716; 

<220> 

<221> unsure 
<222> (71&; 

<220> 

<221> unsure 
<222> (723; , . (726} 

<220> 

<221> unsure 
<222> (728) 

<220> 

<221> unsure 
<222> (732) 

<220> 

<221> unsure 
<222> (737; 

<400> 2 

tctgaaacrg icaqttccsc cagcactgct tggatactgg taaccttcca gggggctgct 60 

ttgcatctga aactgtcagc cccagaatgt tgacagtcgc tctcctagcc cttctctgtc 120 

cctcagccrc tggcaatgcc attcaggcca ggtcttcctc ctaragtgga gagtatggaa 180 

gtggtggtgg --aagcgarzc tctcattctg gcaaccagtt ggacggcccc atcaccgccc 240 

tccggcrccg actcaacaca tactacatcg taggtctrca gg^gcgctat: cgcaaggtgz 300 

ggagcgacta rgrgggtgg-: cgcaacggag acctggagga gatctttctg caccctgggg 360 

aatcagtga^ ccaggt::c: gggaagtaca agtggtacct gaagaagctg gtatttgtga 420 

cagacaaggg ccgctatc:g tcttttggga aagacagtgg cacaagtrtc aaigccgicc 480 

ccttgcaccc caacaccgrg ctccgcttca tcagtggccg gtctggttct ctcatcgatg 540 

ccattggcct gcactggga- gtttacccca ctagctgcag cagatgctga gcctcctctc 600 

cttggcaggg gcacrgtga~ gaggagtaag aactccctra tcacraaccc ccaiccaaat 66C 

ggctcaataa aaaaatatgg ttaaggctaa aanaaaanr.g gannnaanan nnnnnntncs 720 

aannnnant: cncctgnia 739 



WO 99/60161 



PCT/US99/10498 



<210> 3 

<211> 428 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> unsure 
<222> (391) 



<400> 3 




















aattgtccgc 


ggt 




aggagagcat 


gaatgagag t 


cat- 


czeqea 


as 


tgtgcaga 


6C 


gtcttttgag 


atgtggcatg 


atcgtgactc 


ccact gt agg 


cac c 


ctsagt 




gaagggca 


I2C 


t ccccctgag 


r.cr 


t ggaa gt 


ggat ccttgc 


accggtcatt. 




aiacct 


gt 


gaa agg at 


ISC 


cctccggtt: 


C cJ. c 




agcagaaggt 


tgtgattacc 


a acq 


r.igtta 




:accca:c 


2 4 C 


caaagttt tg 


gaa 


z Z g ca g 2 


tgaacaagcg 


tggctt cage 


an go 


aagt gg 


gg 


cagtatia:. 


3CC 


ctt tgttaat 


t gc 




tctctctcct 


gggaatggca 


t c c t 


t T. l a C l 




gacctctc 


36: 


ctccagagga 


aga 




ttcattcata 


tncgagcagc 


a egg 


caci t g 


ac 


aaaaaar." 


4 2 


tataaggg 


















4 29 



<210> 4 

<211> 134' 

<212> DNA 

<213> Homo sapiens 



<400> 4 














ggaaaacccc 


tgagcacaaa 


gcaagaggca 


tcgaagcccc 


cteggggatg 


cccgcaagcc 


6C 


aacaggggtg 


tcgtgccgtg 


ggagtact tc 


cgcctgcgt c 


ctct gcggtt 


cagggcccca 


1 2 C 


gacgagcccc 


agcaggccca 


agtcccccat 


gtctggggct 


gggaggtggc 


tggggcccct 


is: 


gcactgaggc 


t gcagaagt z 


ccagtcat ct 


gatctgetgg 


aaagggagag 


ggagagtgtc 


24: 


ctgcgccggg 


ageaagaegt 


ggcagaggag 


eggagaaatg 


ctct cttccc 


agaggtct t c 


3CC 


tccccaacgc 


cagatgagaa 


ct ctgaccag 


aactccagga 


gctcct ccca 


ggcat cegge 


36: 


ateaegggea 


gttactcgg~ 


gt crgagtct 


ccctt ct t ca 


gccccatcca 


cctacactca 


4 z : 


aacgtggcgt 


ggacagzgga 


agat ccagtg 


gacagt get c 


ctcccgggca 


gagaaagaag 


48: 


gagcaatggt 


aegctggcat 


caacccct eg 


gaeggtat ca 


actcagaggt 


cctggaagcc 


54C 


atacgggtga 


cccgt eacaa 


gaacgccatg 


geagageget 


gggaat cccg 


ca tctacgcc 


6c: 


agtgaggagg 


atgactgagc 


ct egggatgg 


ggcgcccacc 


ccctgccctg 


ccctgaccct 


6 6: 


cgtgggaact 


gccaagacca 


tcgccaagcc 


cccaccctag 


gaaatgggtc 


ct aggtccag 


72C 


gatccaagaa 


ccacagczca 


tctgccaaca 


atcccaccat 


gggcacattt 


gggactgt t g 


78: 


ggtttttcgt 


ticcctttct 


at ctt ccttt 


agaaatgttt 


ctgect ttgc 


ggtctaaagc 


S4 : 


ttttggggat 


gaaatgggga 


cccctgctga 


ttctttctgc 


ttctaagact 


ttgccaaatg 


9c: 


ccctgggtct 


aagaaagaa a 


gagacccgct 


cct ccacttt 


caggtgtaat 


ttgcttccgc 


96: 


tagtctgagg 


gcagaggga z 


eggtcaaaga 


gggtggcaca 


gategcagea 


ccttgagggg 


1C20 


ctgegggtet 


gagggaggag 


acactcagct 


cctccctctg 


agaagtccca 


agctgagagg 


i o e o 


ggagacctgc 


ccctttccaa 


ccctgggaaa 


ccat ccagt c 


tgagggagga 


ggecaaact c 


1 1 4 C 


ccagtgctgg 


gggtccctc;: 


gcagccctca 


aaccctt cac 


cttggtgcac 


ccagccacac 


1200 


ctggtggaca 


caaagct czz 


acatcgatag 


gatcccatga 


gqatggtccc 


ct tcacctgg 


126: 


gagaaaagtg 


aCCCSGt ~ ~ a 


ggagctggag 


gggggtcttt 


gtcccccacc 


cccaaactge 
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cctgaaataa acctggagtg agctgcc 1347 



<210> 5 
<211> 1249 
<212> DNA 

<213> Homo sapiens 

<220> 

<221> unsure 

<222> (1034 ; . . (1046) 



<400> 5 

ggcagagcct gcgcagggce ggagcagctc gcccactggc ggcccgcaac actccgtcrc 60 
accctctggc cccaczqcaz ctagaggagg gccc;.„Ligtg aggccactac ccctccagca 120 
actgggagg:: gggactc~ca gaagcrgqcc caqggtggtg grcagctggg rcagggacct 180 
aacggcacci ggcrgggacc acctcgccrt ctccatcgaa gcaggggaag tgggagcctc 240 
gagcccTicgg gtggaagctg accccaagcc acccticacc rggacaggar. gagagtgtca 300 
ggtgtgctt c ccc;c^:gac cctcatctti accatagtca cgaca:gca: gimattcga 36C 
agctacatga gc:;cacca; gaaaaccatc cgtctgccac gcrggctggc ctcgcccacc 420 
aaggagatcc aggitaaaaa gtacaagtgt ggcctcatca agccczgccc agccaactac 480 
tttgcgttta aaaictgcaa tggggccgcc aacgtcgtgg gccciactat gtgctttgaa 540 
gaccgcatga tcaigagtcc tgtgaaaaac aatgtgggca gaggcctaaa catcgccctg 600 
gtgaatggaa ccacgggagc tgtgctggga cagaagtcat ttgacatgta ctctggagat 660 
gttatgcacc tagtgaaatt ccttaaagaa attccggggg gtgcactggt gctggtggcc 720 
tcctacgacg atccagggac caaaatgaac gatgaaagca ggaaactctt ctctgacttg 780 
gggagttcct acgcaaaaca actgggcttc cgggacagct gggtcttcat aggagccaaa 840 
gacctcaggg gtaaaagccc ctrtgagcag ttcttaaaga acagcccaga cacaaacaaa 900 
tacgagggat ggccagagct gctggagatg gagggctgca tgcccccgaa gccattttag 960 
ggtggctgtg gc:cticc:c agccaggggc ctgaagaagc tccigcctga cttaggagtc 1020 
agagcccggc aggnnnnnnn nnnnnnnnnn nnnnnntgci gcgtggaagg tgctgcaggt 1080 
ccttgcacgc tgrgt cgcgc ctctcct cce cggaaacaga acccrcccac agcacatcct 1 1 4 C 
acccggaaga ccagccicag agggtcctrc tggaaccagc Lgrcr.gtgga gagaatgggg 1201 
tgctttcgtc aggaactgct gacggctggt cctgaqgaag gacaaactg 124 r 



<210> 6 

<211> 1220 

<212> DNA 

<213> Homo sapiens 



<400> 6 

gctttcrgca ccucart cca catcaggagc 
tccagggccc agzggaggga gggagtgaag 
gagactcaga caa gattaca atgaaccaac 
ccagaggatg gagiacagat gaggctaara 
ctccatctct gcccagaagc tgcaaggaaa 
gcctgtattt tc:ccgcac: gagaatggtg 
ctgggggtgg cggccggacc ctggtggcca 
gcacggtggg ccazcqczcg tccagicagc 



gtttt iggag aaagctgcac tctcttgagc 6C 

gagctcrctg tacccaagga aag~gcagct 120 

tcagctuccr gctgzrtctc aiagcgacca 181 

cttacttcaa ggaatggacc tgttctccgt 240 

tcaaagacga atgrcctagt gcatttgatg 300 

ttatcracca gaccttctgr. gacatgacct 360 

gcgtgcacga gaatgacatg cgzgggaagt 420 

agggcagcaa agcagtctac ccagaggggg 480 
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acggcaact g 


gqccaacz ac 


aacacct tt g 


gatctgeaga 


ggcggccacg 


agcgatgacL 


54 0 


acaagaaccc 


tggctaciac 


gacat ccagg 


ccaaggacct 


gggcat ctgg 


cacgtgccca 


600 


ataagtcccc 


cat gcagcac 


tggagaaaca 


gctccctgct 


gaggtacege 


aeggacactg 


660 


get t cct ccs 


gacact ggga 


cat aat ctgt 


1 1 ggcat eta 


ccagaaa t at 


ccagtgaaa: 


720 


atggagaagg 


aaagt gttgg 


act gacaacg 


geceggt ga t 


cc rtgtggt c 


tatgatttt g 


780 


gcgacgccca 


gaaaacagca 


t cttattact 


caccctatgg 


cca gcgggaa 


1 1 cactgegg 


840 


gat tt gt tea 


gtt cagggt a 


tt t aataacg 


agagagcagc 


caacgccttg 


t gtget ggaa 


900 


tgagggtcac 


eggat gtaac 


actgagcacc 


act gcat tge 


t ggaggagga 


t act tt ccag 


960 


aggecagt ce 


ccagcagt gt 


ggagattttt 


ct ggtt t tga 


1 1 gqa gt gga 


tar gga act c 


1020 


atgttggtta 


cagcagcagc 


cgtgaga taa 


ct gaggcagc 


:qtgcttcta 


1 1 ct a t cgt t 


1080 


gagagttttg 


t gggagggaa 


cccagacct c 


t cct cccaa c 


caceaGatc" 


caagga t gga 


1140 


gaacaactta 


cccactacc: 


agaat gt t aa 


t ggcagaaga 


gaaaacast a 


aat ca t a 1 1 g 


1200 


act caaaaaa 


aaaaaaa^ag 










1220 



<210> - 
<21l> 2796 
<21Z> DNA 



<213> Home 


sapiens 












<400> 7 














egget cgagg 


gacagga t ga 


ggcccggcct 


ctcatttct c 


ct agecett c 


tgt t ct t cct 


60 


tggecaaget 


gcaggggatt 


tgggggatgt 


gggacctcca 


at t cccagce 


cegget t cag 


120 


ccctttccca 


ggtgttgact 


ccagct ccag 


cttcagctcc 


agct ccaggt 


cgggctccag 


180 


ctccagccgc 


agcttaggca 


gcggaggttc 


tgtgtcccag 


ttgttttcca 


at tt cacegg 


240 


ctccgtggat 


gaccgtggga 


cctgccagtg 


ctctgtttcc 


ctgccagaca 


ccacctt tec 


300 


cgtggacaga 


gtggaacgct 


tggaattcac 


agctcatgtt 


ct 1 1 ctcaga 


agt ttgagaa 


360 


agaactttcc 


aaactgaggg 


aatatgt cca 


attaattagt 


gt gt at gaaa 


agaaact gt t 


420 


aaacctaact 


gt ccgaattg 


acatcatgga 


gaaggatacc 


atttettaca 


ctgaact gga 


4 80 


cttcgagctg 


atcaaggtag 


aagtgaagga 


gatggaaaaa 


ctggt cat ac 


agct gaagga 


54 0 


gagttttggt 


gga a get cag 


aaattgttga 


ccagctggag 


gtggagataa 


gaaat atgae 


600 


tctcttggta 


gagaagct t g 


agacact aga 


caaaaacaat 


gtccttgcca 


tt cgecgaga 


660 


aatcgtggct 


ct gaagacca 


agct gaaaga 


gtgtgaggcc 


tctaaagar. " 


aaaacaccce 


72C 


tgtcgtccac 


cct cctccca 


ct ccagggag 


ct gtggtcat 


ggrggt gt gg 


tgaacat cag 


7 £ : 


caaaccgtct 


gtggtt cage 


tcaactggag 


agggttttct 


tat ctatat c 


gt get tgggg 


8 4 C 


t agggattac 


tctccccacc 


stccaaacaa 


aggactgta t 


tgggtggcgc 


cat tgaat ac 


JOG 


agatgggaga 


et ct tggagt 


a ttatagact 


gt acaacaca 


ctggatgatt 


tget attgt a 


9 6C 


tataaatget 


egagagt tgc 


ggatcaccta 


t ggccaagqt 


agt ggt acag 


cagtttacaa 


1020 


caacaacatg 


t aegtcaaca 


t gtacaacac 


egggaatatt 


gecagagtta 


acctgaccac 


108C 


caacacgat t 




aaact et ccc 


taatgetgee 


tatastaacc 


get ttt cat a 


11 AC. 


tgctaatgtt 


gctt ggcaag 


atattgactt 


tgctgtggat 


gagaat gga~ 


tgtgggtta- 


I2GC 


t t attcaact 


gaagecagea 


ctggt aacat 


ggtgattagt 


aaa etcaatg 


acaccacac: 


I2 6C 


teaggtgeta 


aacacttggt 


a t accaagca 


gt at aaacca 


t ct get t eta 


aegect t ca~ 




ggtatgtggg 




ccacccgt ae 


tatgaacacc 


agaacagaag 


agatttttt a 


13SC 


ctattatgac 


acaaacacag 


ggaaagaggg 


caaact agac 


att gt aatgc 


at aagatgea 


14 4 G 


ggaaaaagt g 


caga gca 1 1 a 


a ct at aacce 


ttttgaccag 


aaactt tat g 


t ctat aacga 


15 00 


tggttacctt 


et gaat t at g 


atcttt ctgt 


ct tgcagaag 


ccccagt aag 


ct gtt tagga 


156C 


gttagggtga 


aaoacaaaat 


gtttgt tgaa 


aaaatagtct 


tct ccactta 


cttagatar. c 


1 £2 C 


tgcaggggt c 


"ctaaaagee 


t gt tea t ttt 


gcagcaatgt 


ttaggtgcat 


agt t ct acca 


166 0 
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cactagagat 


ctaggacatt 


tgtcttgatt 


tggtgagttc 


tcttgggaat 


cat ctgcctc 


1740 


t tcaggcgca 


ttttgcaata 


aagtctgtct 


agggtgggat 


tgt cagaggt 


ctaggggcac 


1800 


tgtgggcct a 


gtgaagccta 


ctgtgaggag 


gcttcactag 


aagect taaa 


ttaggaatta 


1860 


aggaactr aa 


aact cagt at 


ggegtctagg 


gattctttgt 


acaggaaa t a 


t tgcccaat g 


1920 


actagt cct c 


atccatgtag 


caccactaat 


t ctt ecatge 


ctggaagaaa 


cctggggact 


1980 


tagttaggta 


gat taststc 


tggagct cct 


cgagggacca 


aatct ccaac 


ttttttttcc 


2040 


cctcactaca 


cctggaatga 


tgett t at at 


gt ggcagat a 


agt aaattt g 


geatget tat 


2100 


atat tctaca 


t ct gt aaagt 


gctgagtttt 


at ggagagag 


gect ttt tat 


gcattaaat t 


2160 


gtacatggca 


aataaatccc 


agaaggat ct 


gt agatgagg 


cacctqcttt 


ttcttttctc 


2220 


tcattgtcca 


cct tactaaa 


agt cagtaga 


a t ct t ct acc 


tcataacttc 


ctt ccaaagg 


228 0 


cagct cagaa 


ga t t agaacc 


agact t act a 


accaatt cca 


ccccccacca 


accccct t ct 


2340 


actgcctact 


ttaaaaaast 


taatagt ttt 


ct atggaact 


gat ctaagat 


t agaaaaat t 


2400 


aattttcttt 


aa ^i_wC5^c3 


tggactttta 


tttacatgac 


tctaagacta 


taagaaaatc 


24 60 


tgatggcaqt 


gacaaact cc 


tagcatttat 


t gttatctaa 


raaagacctr 


ggagcatatg 


2520 


tgcaacttai 


qagcgiaica 


gtt gttgeat 


gtaat tt tt g 


cct t tgt 1 1 a 


agect ggaac 


2 58 0 


ttgtaagaaa 


atgaaaa::: 


aatt ttt ttt 


t ctaggacga 


gctataoaaa 


agct at t gag 


2640 


agtatctag: 


taatcactgc 


agtagtt gga 


aacct t getg 


g t g t atgtga 


t at get tcta 


2700 


tgctttt gaa 


tgac-:;:a:c 


atctagtctt 


tgtctgtttt 


tec: t t gate 


t z c a a g t c c t 


2760 


agtct atagg 


at: tggcactt 


t aaatget t t 


actccc 






2 7 9 6 


<210> 8 














<211> 2331 














<212> DNA 














<213> Homo 


sapiens 












<220> 














<221> unsure 












<222> (675} 














<400> 8 














tttatcacgg 


get caact gc 


aacaaaacac 


t t cct tgaca 


get ccacaaa 


ctcaggccac 


60 


agtgaggaat 


caacaatatt 


ccacagcagc 


ccaga t gcaa 


gtggaacaac 


accctcatc: 


1 o :" 


gcccact cca 


caaccteagg 


tcgtggagaa 


tctacaacct 


caegcat cag 


t ccaggct ca 


1 £ ' 


actgaaataa 


caacgttacc 


t ggcagtacc 


acaacaccag 


gectcagt ga 


ggcatctacc 


24 0 


acctt ctaca 


gtagccccsg 


atcaccagac 


caaacactct 


cacctgccag 


catgagaagc 


30C 


tccagcac ca 


gtggagaacc 


caccagcttg 


tatagecaag 


cagagt caac 


acacacaaca 


3e: 


gcgttccctg 


ccagcaccac 


cacctcaggc 


ct cagt cagg 


aat caacaac 


1 1 1 ccacagt 


420 


aagccaggct 


caact gagac 


aacactgtcc 


cctggcagca 


tcacaactt c 


atettttget 


480 


caagaattta 


ccacccczca 


tagccaacca 


ggct cagct c 


tgt caacagt 


gt cacct gec 


540 


agcaccacag 


tgccaggcct 


tact gaggaa 


tctaccacct 


tctacagcag 


cccaggctca 


6C-: 


actgaaacca 


cagegtt t tc 


tcacagcaac 


acaatgtcca 


t tcatagt ca 


acaatctaca 


660 


cccttcccrg 


acagnccagg 


cttcactcac 


acagt gt tac 


ctgccaccct 


cacaaccaca 




gacattggt c 


aggaat caac 


agect t ccac 


agcagct cag 


aegcaactgg 


aacaacaccc 




ttacctgccc 


gctccacagc 


ctcagacctt 


gttggagaac 


ctacaacttt 


ct acat cage 


84 0 


ccatccccta 


cttacacaac 


act ctttcct 


gcgagtt cca 


gcacat cagg 


cct cactgag 


90C 


gaatctacca 


ccttccacac 


cactccaagc 


ttcacttcta 


caattgtgt c 


tactgaaagc 


960 


ctggaaacct 


tagcaccagg 


gttgtgccag 


gaaggacaas 


1 1 1 ggaa t gg 


aaaacaatgc 


1C2C 


gtctgtccec 


aaggct aegt 


tggttaccag 


t gcttgtccc 


ctctggaatc 


ct t ccct gta 


10SC 
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gaaaccccgg aaaaactcaa cgccacttta 
ttcacagaaa agatgaatga cgcatcctcc 
aagaatcgga tggatgtcgt tttgaagggc 
attcggagat tgctcaacgg tagcatcgtg 
tacactttag agtatgagga actgtttgaa 
atgaatgaaa ctagaacaac tcttcttgat 
tatagtgaag aggacacttt cgtggattca 
caatgcaccc agaaggctgc cgaaggatat 
gggaagctgg cctgtctgaa caagtgcacc 
ctgggcacat gtcagccgca acgcagtgga 
acactggtac tggggagaga cctgtgaatt 
cgtgggggct gtgatggcgg tgctgctgct 
cctatcccag agaaaacggc acagggaaca 
aggcacccct ggcaticttcc agaagacggc 
cagattcggc cttgagaacg cctacaacaa 
tggcacagag ctccacatcc agaggccgga 
gggcctccca ccctcaicua gctctgtica 
gcctccgggg cgggrcaaga ggaga^' ;;°a 
ctgacagact tggccagtcc cctgcctgtg 
cctctccatc cgggagc.ttc cagactccca 
tggctcccca ctctggaatt tccctaccaa 

<210> 9 

<211> 909 

<212> DNA 

<213> Homo sapiens 

<400> 9 

gaggaggtgg gcgccaacag acaggcgatt 

acagtataca ttgtggacat tcaggacata 

tacctcgatg cctactttgt cttccccaat 

gtgatgatcc ggaanga~ca ggactcgctg 

ctgggctccc aggagagcca ggagt cagac 

ggattgggag tggctttgct gctggtcctt 

cggaagagct acaaccggaa gcttcaagct 

gcagcagggg tgatgccccc agcccctgcc 

cgagccaacc ccatgctgaa cctccccaac 

tccaatgacc tggactctgt cagcgtcaac 

aagaacagtc aggaaatcaa ggagcacagg 

gagcccctga gcgtggtcct gttaggacgg 

ccatcctaca ccaacgcrgg cctggacacc 

tggacccctt gaagaggccc taccacaccc 

tatatgacgc tgccctgcct cctgcttttg 
tattttatt 

<210> 10 
<211> 5iC 
<212> PRT 



ggtat gacag 


t gaaacrt gac 


tiacagaaat 


114 0 


caggaatacc 


agaacttcag 


t accct ctt c 


1200 


gacaatctt c 


ctcagtatag 


aggggt qaac 


1260 


gtcaagaacc 


atg 1 ;catcct 


ggaggcagac 


1320 


aacctggcag 


agattgtaaa 


ggccaagatt 


1380 


cctgattcc*: 


gcagaaaggc 


ca lactgtgc 


1440 


tcggtgactc 


cgggctrtga 


cttccaggag 


1500 


acccagttc: 


actatgtgga 


t gtctt ggat 


1560 


aaaggaacga 


agtcgcaaat 


gaactgtaac 


162C 


cccccgctgc 


ctgtgcccaa 


a ~ a cqaa cac 


1680 


caacatcqcc 


aagagcctcg 




174 0 


cgcattgat c 


atcc;aatc£ 


rcttattcag 


1800 


gt atgatgt g 


c;: caagagt 


ggcgaaagga 


1860 


catctgggaa 


caccagaaic 


t gagggagag 


1920 






ct gt tgact c 


198C 


gatggnagc^ 


tccacrgigt 


gagccaacgg 


2040 


ggagagctgc 


aaacacagaq 


cc ca cca caa 


2100 


gt caggccct 


qaacccggt c 


cigctctgag 


2160 


ctcctgctgg 


ggaaggctgg 


gggctgt aag 


2220 


gaagcctcgg 


cacccctgtc 


tcctcctggg 


2280 


taaaagcaaa 


t ctgaaagct 


c 


2331 



aatgcggctc 


ttacccaggc 


aaccaggact 


60 


gatt ctgcag 


ct cgggcccg 


acctcactcc 


120 


gggt cagccc 


rgacccttga 


tgagct gagt 


180 


acgcagctgc 


t. gcagct ggg 


get ggtggt g 


24 0 


ctgtcgaaac 


agc:catcag 


tgtcatcaca 


300 


gtgatcatga 


ccatggcct t 


cat gt gt gt g 


3 60 


at gaaggctg 


ccaaggaggc 


caggaagaca 


420 


atcccaggga 


ctaacatgta 


caa cact ;ag 


480 


aaagacctgg 


get tggagta 


cctctctccc 


540 


tccctggacg 


acaactctgt 


ggatgt ggac 


600 


ccaccacaca 


ca ccaccaga 


gecaga t cca 


660 


caggcaggcg 


caagtggaca 


ccrggagggg 


7 20 


acggacctgt 


gaeaggggee 


cccact cttc 


78 0 


taact gcacc 


tgt crccctg 


gaga tgaaaa 


84 0 


gccaatcacg 


gcagacaggg 


gttggggaaa 


900 
909 
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<2I3> Homo sapiens 
<400> 10 

Met Arg Pro Gly Leu Ser Phe Leu Leu Ala Leu Leu Phe Phe Leu Gly 
1 5 10 15 

Gin Ala Ala Gly Asp Leu Gly Asp Val Gly Pro Pro lie Pro Ser Pro 
20 25 30 

:|| Gly Phe Ser Pro Phe Pro Gly Val Asp Ser Ser Ser Ser Phe Ser Ser 

M 35 40 45 

Ser Ser Arc Ser Gly Ser Ser Ser Ser Arg Ser Leu Gly Ser Gly Gly 
50 55 60 

; " Ser Val Ser Gin Leu Phe Ser Asn Phe Thr Gly Ser Val Asp Asp Arg 

65 70 75 80 

Gly Thr Cys Gin Cys Ser Val Ser Leu Pro Asp Thr Thr Phe Pro Vai 
85 90 95 

r| Asp Arg Val Glu Arg Leu Glu Phe Thr Ala His Val Leu Ser Gin Lys 

100 105 110 

Phe Glu Lys Glu Leu Ser Lys Val Arg Glu Tyr Val Gin Leu lie Ser 
115 120 125 

Val Tyr Giu Lys Lys Leu Leu Asn Leu Thr Val Arg lie Asp lie Met 
130 135 140 

Glu Lys Asp Thr lie Ser Tyr Thr Giu Leu Asp Phe Glu Leu lie Lys 
145 150 155 160 

Val Glu Val Lys Giu Met Glu Lys Leu Val lie Gin Leu Lys Giu Ser 
165 170 175 

Phe Gly Gly Ser Ser Glu lie Vai Asp Gin Leu Glu Val Glu lie Arg 
180 185 190 

Asn Met Thr Leu Leu Val Glu Lys Leu Glu Thr Leu Asp Lys Asn Asn 
« 195 200 205 

Val Leu Ala He Arg Arg Glu He Val Ala Leu Lys Thr Lys Leu Lys 
210 215 220 

Glu Cys Glu Ala Ser Lys Asp Gin Asn Thr Pro Val Val His Pro Pro 
225 230 235 24C 
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Pro Thr Pro Gly Ser Cys Gly His Gly Gly Val Val Asn lie Ser Lys 
245 250 255 

Pro Ser Val Val Gin Leu Asn Trp Arg Gly Phe Ser Tyr Leu Tyr Gly 
260 265 270 

Ala Trp Gly Arg Asp Tyr Ser Pro Gin His Pro Asn Lys Gly Leu Tyr 
275 280 285 

Trp Val Ala Pro Leu Asn Thr Asp Gly Arc Leu Leu Glu Tyr Tyr Arg 
290 295 30C 

Leu Tyr Asn Thr Leu Asp Asp Leu Leu Leu Tyr lie Asn Ala Arg Giu 
305 310 315 320 

Leu Arg lie Thr Tyr Gly Gin Gly Ser Gly Thr Ala Val Tyr Asn Asn 
325 o30 335 

Asn Met Tyr Val Asn Met Tyr Asn Thr Gly Asn lie Ala Arg Val Asn 
340 345 350 

Leu Thr Thr Asn Thr lie Ala Val Thr Gin Thr Leu Pro Asn Ala Ala 
355 360 365 

Tyr Asn Asn Arg Phe Ser Tyr Ala Asn Val Ala Trp Gin Asp lie Asp 
370 375 380 

Phe Ala Val Asp Glu Asn Gly Leu Trp Val lie Tyr Ser Thr Giu Ala 
385 390 395 400 

Ser Thr Gly Asn Met Val lie Ser Lys Leu Asn Asp Thr Thr Leu Gin 
405 410 415 

Val Leu Asn Thr Trp Tyr Thr Lys Gin Tyr Lys Pro Ser Ala Ser Asn 
420 425 430 

Ala Phe Met Val Cys Gly Val Leu Tyr Ala Thr Arg Thr Met Asn Thr 
435 440 445 

Arg Thr Glu Glu lie Phe Tyr Tyr Tyr Asp Thr Asn Thr Gly Lys Giu 
450 455 460 

Gly Lys Leu Asp lie Val Met His Lys Met Gin Glu Lys Val Gin Ser 
465 470 475 480 

lie Asn Tyr Asn Pre Phe Asp Gin Lys Leu Tyr Val Tyr Asn Asp Gly 
485 490 495 
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Tyr Leu Leu Asn Tyr Asp Leu Ser Val Leu Gin Lys Pro Gin 
500 505 510 



if 



10 
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